Markov decision process

Results: 537



#Item
91Towards	
  a	
  Principled	
  General	
  Approach	
  to	
  	
   Motion	
  Planning	
  under	
  Uncertainty	
  	
   David	
  Hsu	
   National	
  University	
  of	
  Singapore	
    	
  

Towards  a  Principled  General  Approach  to     Motion  Planning  under  Uncertainty     David  Hsu   National  University  of  Singapore    

Add to Reading List

Source URL: www.cse.unr.edu

Language: English - Date: 2012-12-20 19:29:32
92Learning Anticipation Policies for Robot Table Tennis Zhikun Wang1,2 , Christoph H. Lampert3 , Katharina Mülling1,2 , Bernhard Schölkopf1 , Jan Peters1,2 Abstract— Playing table tennis is a difficult task for robots,

Learning Anticipation Policies for Robot Table Tennis Zhikun Wang1,2 , Christoph H. Lampert3 , Katharina Mülling1,2 , Bernhard Schölkopf1 , Jan Peters1,2 Abstract— Playing table tennis is a difficult task for robots,

Add to Reading List

Source URL: www.is.tuebingen.mpg.de

Language: English - Date: 2011-10-10 12:02:25
93Addressing the Policy-bias of Q-learning by Repeating Updates ∗ Sherief Abdallah

Addressing the Policy-bias of Q-learning by Repeating Updates ∗ Sherief Abdallah

Add to Reading List

Source URL: michaelkaisers.com

Language: English - Date: 2013-05-22 14:31:54
94Stochastic and fluid index policies for resource allocation problems M. Larran˜aga1,2,5 , U. Ayesta2,3,4,5 , I.M. Verloop1,5 IRIT, 2 rue C. Carmichel, FToulouse, France. 2 CNRS, LAAS, 7 avenue du colonel Roche, F

Stochastic and fluid index policies for resource allocation problems M. Larran˜aga1,2,5 , U. Ayesta2,3,4,5 , I.M. Verloop1,5 IRIT, 2 rue C. Carmichel, FToulouse, France. 2 CNRS, LAAS, 7 avenue du colonel Roche, F

Add to Reading List

Source URL: verloop.perso.enseeiht.fr

Language: English - Date: 2015-04-01 14:42:52
95Reusable Sampling-Based Techniques for Manipulation via Pushing Christopher Vo Abstract—In this work, we consider the problem of manipulating a polygonal object through an obstacle-filled environment using only push in

Reusable Sampling-Based Techniques for Manipulation via Pushing Christopher Vo Abstract—In this work, we consider the problem of manipulating a polygonal object through an obstacle-filled environment using only push in

Add to Reading List

Source URL: www.cse.unr.edu

Language: English - Date: 2012-12-20 19:29:59
96Using Iterated Reasoning to Predict Opponent Strategies Michael Wunder Michael Kaisers  Rutgers University

Using Iterated Reasoning to Predict Opponent Strategies Michael Wunder Michael Kaisers Rutgers University

Add to Reading List

Source URL: michaelkaisers.com

Language: English - Date: 2012-04-29 08:04:26
97Shared Autonomy via Hindsight Optimization Shervin Javdani, Siddhartha S. Srinivasa, J. Andrew Bagnell The Robotics Institute, Carnegie Mellon University {sjavdani, siddh, dbagnell}@cs.cmu.edu Abstract—In shared autono

Shared Autonomy via Hindsight Optimization Shervin Javdani, Siddhartha S. Srinivasa, J. Andrew Bagnell The Robotics Institute, Carnegie Mellon University {sjavdani, siddh, dbagnell}@cs.cmu.edu Abstract—In shared autono

Add to Reading List

Source URL: www.ri.cmu.edu

Language: English - Date: 2015-05-06 09:31:40
98Reusable Sampling-Based Techniques for Manipulation via Pushing Christopher Vo Abstract—In this work, we consider the problem of manipulating a polygonal object through an obstacle-filled environment using only push in

Reusable Sampling-Based Techniques for Manipulation via Pushing Christopher Vo Abstract—In this work, we consider the problem of manipulating a polygonal object through an obstacle-filled environment using only push in

Add to Reading List

Source URL: iros2011.org

Language: English - Date: 2011-09-21 07:00:45
99Stochastic and fluid index policies for resource allocation problems M. Larran˜aga1,2,5 , U. Ayesta2,3,4,5 , I.M. Verloop1,5 IRIT, 2 rue C. Carmichel, FToulouse, France. 2 CNRS, LAAS, 7 avenue du colonel Roche, F

Stochastic and fluid index policies for resource allocation problems M. Larran˜aga1,2,5 , U. Ayesta2,3,4,5 , I.M. Verloop1,5 IRIT, 2 rue C. Carmichel, FToulouse, France. 2 CNRS, LAAS, 7 avenue du colonel Roche, F

Add to Reading List

Source URL: homepages.laas.fr

Language: English - Date: 2015-02-17 15:01:29
100Asymptotically optimal index policies for an abandonment queue with convex holding cost∗ M. Larra˜ naga1,2,5 , U. Ayesta2,3,4,5 , I.M. Verloop1,5 1 CNRS, IRIT, 2 rue C. Carmichel, FToulouse, France.

Asymptotically optimal index policies for an abandonment queue with convex holding cost∗ M. Larra˜ naga1,2,5 , U. Ayesta2,3,4,5 , I.M. Verloop1,5 1 CNRS, IRIT, 2 rue C. Carmichel, FToulouse, France.

Add to Reading List

Source URL: homepages.laas.fr

Language: English - Date: 2015-04-15 10:57:22